feat: Add SPSA optimization method (Issue #357) by Paramveersingh-S · Pull Request #1712 · google-deepmind/optax

Paramveersingh-S · 2026-06-23T06:39:52Z

Addresses #357

Description

This PR implements the Simultaneous Perturbation Stochastic Approximation (SPSA) gradient estimator to address the open feature request #357.

Rather than implementing it as a stateful optax optimizer, it is implemented as a composable gradient estimator (optax.contrib.spsa_estimator). This aligns best with JAX's functional paradigm, allowing users to pass the resulting grad_fn directly into any existing optax optimizer (SGD, Adam, etc.) and optax.chain. Standard polynomial schedules for learning rate and perturbation scaling are also provided.

Verification

I have added rigorous unit tests in tests/contrib/spsa_test.py utilizing chex.all_variants:

Unbiasedness Test: Verified that over 10,000 samples, the expected SPSA gradient strictly matches the true gradient for a multivariable quadratic function.
Optimizer Integration Test: Verified seamless integration with optax.sgd minimizing a noisy objective over 50 steps.
Successfully tested compilation under jax.jit and jax.vmap.

Note on Author Verification: Since SPSA is a classical algorithm by Spall (1998) and not a recent paper, I did not directly email the author. However, the mathematical unbiasedness tests confirm its correctness.

google-cla · 2026-06-23T06:39:57Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

servusdei2018 · 2026-06-23T19:21:50Z

+def spsa_standard_schedule(
+    init_value: float,
+    decay_rate: float,
+    offset: float = 0.0,


If a user instantiates this schedule with the defaults, the very first step (count=0) will result in a ZeroDivisionError (or yield inf in JAX). Change the default offset to something mathematically stable, or at least enforce that offset > 0 if count starts at 0.

servusdei2018 · 2026-06-23T19:22:44Z

+        grad_estimate = jax.tree.map(
+            lambda d: (y_plus - y_minus) / (2.0 * c) * d, delta
+        )


You are recalculating for every single leaf in the PyTree. y_plus, y_minus, and c are all scalars. Calculate this scalar coefficient once outside the tree map, then just apply the multiplication

Do this instead:

scalar_diff = (y_plus - y_minus) / (2.0 * c) grad_estimate = jax.tree.map(lambda d: scalar_diff * d, delta)

servusdei2018 · 2026-06-23T19:24:03Z

+        # equivalent
+        # to multiplying by delta_i. We multiply for numerical stability.
+        grad_estimate = jax.tree.map(
+            lambda d: (y_plus - y_minus) / (2.0 * c) * d, delta


We need numerical safety here. If c decays to exactly 0 or gets sufficiently small this division will explode.

servusdei2018 · 2026-06-23T19:24:58Z

+        self.assertAlmostEqual(val_0, 1.0 / (10.0**0.5))
+        self.assertAlmostEqual(val_10, 1.0 / (20.0**0.5))


Stick to np.testing.assert_allclose

Paramveersingh-S · 2026-06-24T04:25:50Z

Thanks for the thorough review, @servusdei2018! I've pushed a new commit addressing all of your points:

Schedule stability: I've updated spsa_standard_schedule to default to offset = 1.0 so that count=0 is mathematically stable right out of the box and avoids yielding inf.
Optimized gradient calculation: The scalar division (y_plus - y_minus) / (2.0 * safe_c) is now hoisted and calculated just once before the jax.tree.map, applying only the multiplication across the PyTree leaves.
Numerical safety: I've protected the perturbation scale division using jnp.maximum(c, jnp.finfo(jnp.result_type(c)).eps) to explicitly avoid ZeroDivisionError explosions as c decays.
Testing Standards: I've updated the testing macros in spsa_test.py from assertAlmostEqual over to np.testing.assert_allclose.

All tests pass perfectly locally. Let me know if everything looks good on your end!

Paramveersingh-S force-pushed the main branch 2 times, most recently from d304c4b to 1bc8449 Compare June 23, 2026 06:51

feat: Add SPSA optimization method (Issue google-deepmind#357)

16b5035

Paramveersingh-S force-pushed the main branch from 1bc8449 to 16b5035 Compare June 23, 2026 07:10

Paramveer singh added 2 commits June 23, 2026 17:27

Update optax

ab59984

chore: remove scratch files accidentally committed

3c0f0ab

servusdei2018 reviewed Jun 23, 2026

View reviewed changes

fix: address reviewer feedback for SPSA estimator

7d24300

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add SPSA optimization method (Issue #357)#1712

feat: Add SPSA optimization method (Issue #357)#1712
Paramveersingh-S wants to merge 4 commits into
google-deepmind:mainfrom
Paramveersingh-S:main

Paramveersingh-S commented Jun 23, 2026

Uh oh!

google-cla Bot commented Jun 23, 2026

Uh oh!

servusdei2018 Jun 23, 2026

Uh oh!

servusdei2018 Jun 23, 2026

Uh oh!

servusdei2018 Jun 23, 2026

Uh oh!

servusdei2018 Jun 23, 2026

Uh oh!

servusdei2018 Jun 23, 2026

Uh oh!

Paramveersingh-S commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		self.assertAlmostEqual(val_0, 1.0 / (10.0**0.5))
		self.assertAlmostEqual(val_10, 1.0 / (20.0**0.5))

Uh oh!

Conversation

Paramveersingh-S commented Jun 23, 2026

Description

Verification

Uh oh!

google-cla Bot commented Jun 23, 2026

Uh oh!

servusdei2018 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

servusdei2018 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

servusdei2018 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

servusdei2018 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

servusdei2018 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

Paramveersingh-S commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants